# High-precision Image Classification

PE Core G14 448
Apache-2.0
The Perception Encoder (PE) is a state-of-the-art image and video understanding encoder trained through simple vision-language learning, achieving top performance across various visual tasks.
Text-to-Image
P
facebook
22.83k
14
Centraasia ResNet 50
MIT
A pre-trained model based on the ResNet-50 architecture, specifically fine-tuned for Central Asian food image classification tasks, supporting 34 types of Central Asian food classification.
Image Classification Transformers English
C
Eraly-ml
86
1
Aimv2 Huge Patch14 224
The AIMv2 series are vision models pretrained with multimodal autoregressive objectives, demonstrating excellent performance across multiple benchmarks.
Image Classification
A
apple
54
9
Vit Batik
MIT
This is an image classification model based on Vision Transformer (ViT) and BEiT architectures, specifically designed for recognizing Indonesian batik patterns.
Image Classification Safetensors Other
V
dewanakl
60
1
Mambavision L 1K
Other
The first hybrid computer vision model combining the advantages of Mamba and Transformer, enhancing visual feature modeling capabilities through redesigned Mamba formulation
Image Classification Transformers
M
nvidia
1,542
5
Vitezoa
This is an image classification model generated by HuggingPics, capable of classifying various images such as animals, birds, and flags.
Image Classification Transformers
V
ezoa
15
0
Swinv2 Chaoyang
Apache-2.0
This is a visual image classification model trained on the ImageNet-1k dataset, capable of recognizing various common objects and scenes.
Image Classification Transformers
S
Snarci
14
0
Clip Vit Large Patch14 Finetuned Fruits 360 Vitlarge
High-precision fruit image classification model fine-tuned on the Fruits-360 dataset based on CLIP ViT-Large
Image Classification Transformers
C
AnneMarie1
29
0
Treeclassification
A vision model for image classification, capable of recognizing various common objects and scenes.
Image Classification Transformers
T
OttoYu
22
0
Autotrain Classify 42751109216
This is a binary classification image classification model trained via AutoTrain, demonstrating perfect validation metrics.
Image Classification Transformers
A
vevlins
16
0
Swin Tiny Patch4 Window7 224 Finetuned Eurosat
Apache-2.0
An image classification model based on the Swin Transformer Tiny architecture, fine-tuned on the CIFAR10 dataset with an accuracy of 97.24%
Image Classification Transformers
S
eric1993
16
0
Autotrain Pick A Card 3726099224
This is a multi-category image classification model trained using AutoTrain, achieving an outstanding accuracy of 98.9% on the validation set.
Image Classification Transformers
A
rwcuffney
18
0
Platzi Vit Model Orlando Murcia
Apache-2.0
High-precision image classification model fine-tuned on the beans dataset based on Google's ViT model
Image Classification Transformers
P
platzi
37
0
Autotrain Retrain Db16d58 2983986070
This is a multi-class image classification model trained via AutoTrain, demonstrating perfect classification performance on test datasets
Image Classification Transformers
A
sbrandeis-test-org
18
0
Autotrain Test Auto Nlp 2885884378
This is a binary classification model trained via AutoTrain, specifically designed for image classification tasks.
Image Classification Transformers
A
owsgfwnlgjuz
16
0
Vit Base Patch16 224 In21k Bart Or Homer
Apache-2.0
This is a binary classification model fine-tuned based on the ViT architecture, designed to distinguish images of Bart Simpson and Homer Simpson.
Image Classification Transformers English
V
DunnBC22
22
1
Swin Tiny Patch4 Window7 224 Finetuned Trash Classification
Apache-2.0
A fine-tuned model based on Swin Transformer architecture for garbage classification tasks, achieving 88.27% accuracy
Image Classification Transformers
S
maixbach
22
2
Convnext Tiny 224 Finetuned Eurosat Albumentations
Apache-2.0
A fine-tuned image classification model based on ConvNeXt-Tiny architecture, achieving 98.15% accuracy on the EuroSAT dataset
Image Classification Transformers
C
toshio19910306
18
0
Convnext Tiny 224 Finetuned Eurosat Albumentations
Apache-2.0
This model is a fine-tuned image classification model based on the ConvNeXt-Tiny architecture, trained on the EuroSAT dataset with albumentations for data augmentation, achieving 98.48% accuracy on the evaluation set.
Image Classification Transformers
C
jypasona
18
0
Swin Base Finetuned Cifar100
Apache-2.0
This model is an image classification model fine-tuned on the CIFAR-100 dataset based on the Swin Transformer architecture, achieving an accuracy of 92.01%.
Image Classification Transformers
S
MazenAmria
119
1
Convnext Tiny 224 Eurosat
Apache-2.0
This model is a fine-tuned version based on the ConvNeXt-Tiny architecture, specifically designed for image classification tasks, achieving an accuracy of 95.37% on the EuroSAT dataset.
Image Classification Transformers
C
polejowska
13
0
Vit Model Juan Bula
Apache-2.0
An image classification model fine-tuned on the beans dataset based on Google's ViT model, used to identify the health status of bean leaves
Image Classification Transformers
V
JuandaBula
13
0
Vit Base Beans Demo V5
Apache-2.0
An image classification model fine-tuned on the beans dataset based on Google's ViT base model, achieving 98.5% accuracy
Image Classification Transformers
V
amy-why
13
0
Vit Base Patch16 224 Finetuned Imageclassification
Apache-2.0
Image classification model fine-tuned on image folder dataset based on Google's ViT model, achieving 95.02% accuracy
Image Classification Transformers
V
thaonguyen274
13
0
Swin Tiny Patch4 Window7 224 Finetuned Woody LeftGR 130epochs
Apache-2.0
Image classification model based on Swin Transformer Tiny architecture, fine-tuned for 130 epochs on a specific image dataset
Image Classification Transformers
S
Alex-VisTas
12
0
Levit 192 Finetuned On Unlabelled IA With Snorkel Labels
Apache-2.0
This model is a fine-tuned version of facebook/levit-192 on an unlabeled dataset, demonstrating excellent performance in precision, recall, F1 score, and accuracy.
Image Classification Transformers
L
ImageIN
19
0
Vit Classification Huggingface
A classification model based on Hugging Face Vision Transformer for the Animal-10 dataset, achieving an accuracy of 98.09%
Image Classification Transformers
V
pytholic
14
0
Swin Tiny Patch4 Window7 224 Finetuned Eurosat
Apache-2.0
A fine-tuned image classification model based on Swin Transformer architecture, achieving 95.17% accuracy on image classification tasks
Image Classification Transformers
S
ezzouhri
13
0
Vit Base Patch16 384 Wi5
Apache-2.0
This model is a fine-tuned Vision Transformer based on google/vit-base-patch16-384, primarily used for image classification tasks.
Image Classification Transformers
V
Imene
21
0
Vit Base Patch16 384 Wi3
Apache-2.0
Fine-tuned model based on Google Vision Transformer (ViT) architecture, suitable for image classification tasks
Image Classification Transformers
V
Imene
21
0
Resnet 50 Finetuned Resnet50 0831
Apache-2.0
An image classification model fine-tuned on image folder dataset based on Microsoft's ResNet-50 model, achieving 97.64% accuracy
Image Classification Transformers
R
morganchen1007
27
0
Modeversion1 M7 E4
Apache-2.0
This model is an image classification model fine-tuned on the sudo-s/herbier_mesuem7 dataset based on google/vit-base-patch16-224-in21k, achieving an evaluation accuracy of 97.31%.
Image Classification Transformers
M
sudo-s
28
0
Check Gum Teeth
This is an image classification model built with PyTorch and HuggingPics, specifically designed for detecting gum health conditions.
Image Classification Transformers
C
steven123
27
0
Convnext Tiny 224 Finetuned Eurosat Albumentations
Apache-2.0
A fine-tuned model based on the ConvNeXt-Tiny architecture, optimized for image classification tasks, excelling on the EuroSAT dataset
Image Classification Transformers
C
aihub007
20
0
Swin Tiny Finetuned Dogfood
Apache-2.0
A dog food image classification model fine-tuned based on Swin Transformer Tiny architecture, achieving 98.8% accuracy on the test set
Image Classification Transformers
S
sasha
15
1
Vit Test 1 95
This is an image classification model based on the Vision Transformer architecture, achieving an accuracy of 95.02%.
Image Classification Transformers
V
25khattab
15
0
Swin Tiny Patch4 Window7 224 Finetuned Eurosat
Apache-2.0
This is a vision model based on the Swin Transformer Tiny architecture, fine-tuned on the EuroSAT dataset, primarily used for image classification tasks.
Image Classification Transformers
S
GRANTHE2761
15
0
Swin Tiny Patch4 Window7 224 Finetuned Eurosat
Apache-2.0
This is a tiny model based on the Swin Transformer architecture, specifically designed for image classification tasks and fine-tuned on the EuroSAT dataset.
Image Classification Transformers
S
guhuawuli
14
0
Convnext Large 224
Apache-2.0
ConvNeXT is a pure convolutional model inspired by vision Transformers, trained on the ImageNet-1k dataset at 224x224 resolution.
Image Classification Transformers
C
facebook
740
27
Vit Base Patch16 224 In21k Finetuned Cifar10
Apache-2.0
This is a vision Transformer model based on Google's ViT base model, fine-tuned on the CIFAR10 dataset for image classification tasks.
Image Classification Transformers
V
nielsr
31
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase